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HYBRID PROTEINS WHICH FROM HETERODIHERS 

FIELD OF THE INVENTION 

The present invention relates to a hybrid protein 
comprising two coexpressed amino acid sequences forming a 
dirtier, each comprising: 

a) at least one amino acid sequence selected from a 
homomeric receptor, a chain of a heteromeric receptor, a 
ligand, and fragments thereof; and 

b) a subunit of a heterodimeric proteinaceous hormone 
or fragments thereof; in which (a) and (b) are bonded directly 
or through a peptide linker, and, in each couple, the two 
subunits (b) are different and capable of aggregating to form a 
dimer complex. 

BACKGROUND OF THE INVENTION 

Protein-protein interactions are essential to the 
normal physiological functions of cells and multicellular 
organisms. Many proteins in nature exhibit novel or optimal 
functions when complexed with one or more other protein chains. 
This is illustrated by various ligand- receptor combinations 
that contribute to regulation of cellular activity. Certain 
ligands, such as tumor necrosis factor a (TNFa) , TNF0 , or 
human chorionic gonadotropin (hCG) , occur as multi-subunit 
complexes. Some of these complexes contain multiple copies of 
the same subunit. TNFa and TNF/3 (collectively referred to 
hereafter as TNF) are homotrimers formed by three identical 
subunits (1-4) . Other ligands are composed of non- identical 
subunits. For example, hCG is a heterodimer (5-7) . Receptors 
may also occur or function as multi-chain complexes. For 
example, receptors for TNF transduce a signal after being 
aggregated to form dimers (8,9). Ligands to these receptors 
promote aggregation of two or three receptor chains, thereby 
affording a mechanism of receptor activation. For example, 
TNF-mediated aggregation activates TNF receptors (10-12) . 

The modulation of protein-protein interactions can be 
a useful mechanism for therapeutic intervention in various 
diseases and pathologies. Soluble binding proteins, that can 
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interact with ligands, can potentially sequester the ligand 
away from the receptor, thereby reducing the activation of that 
particular receptor pathway. Alternatively, sequestration of 
5 the ligand may delay its elimination or degradation, thereby 
increasing its duration of effect, and perhaps its apparent 
activity in vivo. In the case of TNF, soluble TNF receptors 
have been primarily associated with inhibition of TNF activity 
(13-17) . 

10 Soluble binding proteins may be useful for treating 

human diseases. For example, soluble TNF receptors have been 
shown to have efficacy in animal models of arthritis (18,19). 

Since TNF has three binding sites for its receptor 
(10-12), and dimerization of the cell surface receptor is 

15 sufficient for bioactivity (8,9), it is likely that binding of 
a single soluble receptor to TNF will leave open the 
possibility that this 1:3 complex of soluble receptor: TNF 
(trimer) can still bind and activate a pair of cell surface TNF 
receptors. To achieve an inhibitory effect, it would be 

20 expected that two of the receptor binding sites on the TNF 
trimer must be occupied or blocked by the soluble binding 
protein. Alternatively, the binding protein could block proper 
orientation of TNF at the cell surface . 

Generally speaking, the need was felt of synthesizing 

25 proteins that contain two receptor (or ligands) chains, as 
dimeric hybrid protein. See Wallach et al . , U.S. patent 
5,478, 925. 

The primary strategy employed for generating dimeric 
or multimeric hybrid proteins, containing binding domains from 

30 extracellular receptors, has been to fuse these proteins to the 
constant regions of an antibody heavy chain. 

This strategy led, for example, to the construction 
of CD4 immunoadhesins (20) . These are hybrid molecules 
consisting of the first two (or all four) immunoglobulin -like 

35 domains of CD4 fused to the constant region of antibody heavy 
and light chains. This strategy for creating hybrid molecules 
was adapted to the receptors for TNF (10,16,21) and led to the 
generation of constructs with higher in vitro activity than 
the monomeric soluble binding proteins . 
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It is widely held that the higher in vitro potency 
of the dimeric fusion proteins should translate into higher in 
vivo activity. One study does support this, revealing an at 
least 50 -fold higher activity for a p75(TBP2)-Ig fusion protein 
in protecting mice from the consequences of intravenous LPS 
injection (16) . 

However, despite the widespread utilization of 
immunoglobulin fusion proteins, this strategy has several 
drawbacks . One is that certain immunoglobulin Fc domains 
participate in effector functions of the immune system. These 
functions may be undesirable in a particular therapeutic 
setting (22) . 

A second limitation pertains to the special cases 
where it is desirable to produce heteromeric fusion proteins, 
for example soluble analogs of the heteromeric IL-6 or type I 
interferon receptors . Although there are numerous methods for 
producing bifunctional antibodies (e.g., by co-transf ection or 
hybridoma fusions) , the efficiency of synthesis is greatly 
compromised by the mixture of homodimers and heterodimers that 
typically results (23) . Recently there have been several 
reports describing the use of leucine zipper motifs to guide 
assembly of heterodimers (24-26) . This appears to be a 
promising approach for research purposes, but the non-native or 
intracellular sequences employed may not be suitable for 
chronic applications in the clinic due to antigenicity. The 
efficiency of assembly and stability post assembly may also be 
limitations . 

On the other hand, in the particular case of TNF 
receptors, certain modifications to the p55 TNF receptor have 
been found to facilitate homodimerization and signaling in the 
absence of ligand (27,28) . It has been found that a 
cytoplasmic region of the receptor, termed the "death domain, " 
can act as a homodimerization motif (28,30). As an alternative 
to an immunoglobulin hybrid protein, fusion of the 
extracellular domain of the TNF receptor to its cytoplasmic 
death domain could conceivably result in a secreted protein 
which can dimerize in the absence of TNF. Such fusion proteins 
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have been disclosed and claimed in the International Patent 
Application WO 95/31544 . 

A third further strategy employed for generating 
5 dimers of soluble TNF receptors has been to chemically cross- 
link the monomeric proteins with polyethylene glycol {31) . 

SUMMARY OF THE INVENTION 

An alternative for obtaining such dimeric proteins, 

!0 offering some important advantages, is the one of the present 
invention and consists in using a natural heterodimeric 
scaffold corresponding to a circulating non- immunoglobulin 
protein with a long half -life. A preferred example is hCG, a 
protein that is secreted well, has good stability, and has a 

!5 long half-life (32-33). Given hCG's prominent role as a marker 
of pregnancy, many reagents have been developed to quantitate 
and study the protein in vitro and in vivo . In addition, 
hCG has been extensively studied using mutagenesis, and it is 
known that small deletions to the protein, such as removal of 

20 five residues at the extreme carboxyl- terminus of the a 

subunit, can effectively eliminate its biological activity 
while preserving its capability to form heterodimer (34,35). 
Small insertions, of up to 30 amino acids, have been shown to 
be tolerated at the amino- and carboxyl -termini of the or 

25 subunit (36) , while fusion of the a subunit to the carboxyl 
terminus of the 0 subunit also had little effect on 
heterodimer formation (37) . 

An analog of hCG in which an immunoglobulin Fc domain 
was fused to the C-terminus of hCG 0 subunit has also been 

30 reported; however, this construct was not secreted and no 
effort was made to combine it with an a subunit (38) . 

Therefore, the main object of the present invention 
is a hybrid protein comprising two coexpressed amino acid 
sequences forming a dimer, each comprising: 

35 a) at least one amino acid sequence selected among a 

homomeric receptor, a chain of a heteromeric receptor, a 
ligand, and fragments thereof; and 

b) a subunit of a heterodimeric proteinaceous 
hormone, or fragments thereof; in which (a) and (b) are bonded 
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directly or through a peptide linker, and in each couple the 
two subunits (b) are different and capable of aggregating 
forming a dimer complex. 
5 According to the present invention, the linker may be 

enzymatically cleavable. 

Sequence (a) is preferably selected among: the 
extracellular domain of the TNF Receptor 1 (55 kDa, also 
called TBP1) , the extracellular domain of the TNF Receptor 2 

10 (75 kDa, also called TBP2) , or fragments thereof still 

containing the ligand binding domain; the extracellular domains 
of the IL-6 receptors (also called gp80 and gpl30) ; the 
extracellular domain of the I FN a/0 receptor or I FN y 
receptor; a gonadotropin receptor or its extracellular 

15 fragments; antibody light chains, or fragments thereof, 
optionally associated with the respective heavy chains; 
antibody heavy chains, or fragments thereof, optionally 
associated with the respective light chains; antibody Fab 
domains; or ligand proteins, such as cytokines, growth factors 

20 or hormones other than gonadotropins, specific examples of 
which include IL-6, IFN-/3, TPO, or fragments thereof. 

Sequence (b) is preferably selected among a hCG, FSH, 
LH, TSH, inhibin subunit, or fragments thereof. 

Modifications to the proteins, such as chemical or 

25 protease cleavage of the protein backbone, or chemical or 

enzymatic modification of certain amino acid side chains, can 
be used to render the components of the hybrid protein of the 
invention inactive. This restriction of activity may also be 
accomplished through the use of recombinant DNA techniques to 

30 alter the coding sequence for the hybrid protein in a way that 
results directly in the restriction of activity to one 
component, or that renders the protein more amenable to 
subsequent chemical or enzymatic modification. 

The above hybrid proteins will result in 

35 mono functional, bifunctional or multifunctional molecules, 
depending on the amino acid sequences (a> that are combined 
with (b) . In each couple, (a) can be linked to the amino 
termini or to the carboxy termini of (b) , or to both. 
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A monoclonal hybrid protein of the present invention 
can, for instance, comprise the extracellular domain of a 
gonadotropin receptor linked to one of the corresponding 
5 receptor-binding gonadotropin subunits. According to such an 
embodiment, the hybrid protein of the invention can be a 
molecule in which, for example, the FSH receptor extracellular 
domain is linked to FSH to increase plasma half -life and 
improve biological activity. 

10 This preparation can be employed to induce follicular 

maturation in assisted reproduction methods, such as ovulation 
induction or in vitro fertilisation, and to serve as a means 
to dramatically amplify the biological activity of the hormone 
essential for the success of the process, thus reducing the 

15 requirement for both the hormone itself and the number of 
injections to achieve ovulation. 

The FSH receptor and the production of the 
extracellular domain of the human FSH receptor have been 
described respectively in WO 92/16620 and WO 96/38575. 

20 According to a particular embodiment, the 

extracellular domain of the FSH receptor (ECD) can be fused in 
frame with a peptide linker that contains the thrombin 
recognition/cleavage site (29) and represents a "tethered" arm. 
The peptide linker links the extracellular domain of FSH with a 

25 FSH subunit. This will allow for removal of the extracellular 
domain of the FSH receptor by cleavage at the thrombin cleavage 
site as the molecule comes in contact with thrombin in the 
systemic circulation. 

In another embodiment, instead of the thrombin 

3 0 cleavage site, an enzyme recognition site for an enzyme that is 
found in greatest abundance in the ovary is used. In this way, 
as the ECD- FSH molecule travels to the ovary, it will be 
exposed to enzymes found in the highest concentrations in that 
tissue and the ECD will be removed so that the FSH can interact 

35 with the membrane bound receptor. 

In yet another embodiment, instead of an enzyme 
recognition site, a flexible hinge region is cloned between ECD 
and FSH so that the ECD will not be enzymatically removed from 
the hormone. In this way, when the ECD-FSH molecule arrives at 
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the ovary, a competition will be established between the hinge- 
attached ECD and the ECD of the FSH receptor found on the 
ovarian cell membrane. 
5 In a further preferred embodiment of the invention, 

the hybrid protein consists of the aggregation between a couple 
of aa sequences, one of which contains TBPl (or the fragments 
from aa 20 to aa 161 or to aa 190) as (a) and the a subunit of 
hCG as (b) , and the other contains always TBPl (or the same 

10 fragments as above) as (a) and the 0 subunit of hCG, or 

fragments thereof, as (b) . According to this embodiment, 
depending on the particular sequence that is chosen as (b) (the 
entire 0 subunit of hCG, or fragments or modifications 
thereof) , the resulting hybrid protein will have one activity 

15 (only that of TBPl) or a combination of activities (that of 

TBPl with that of hCG) . In this latter case the hybrid protein 
can be used, for example, in the combined treatment of Kaposi's 
sarcoma and metabolic wasting in AIDS. 

In a further embodiment of the invention, one or more 

20 covalent bonds between the two subunit s (b) are added to 

enhance the stability of the resulting hybrid protein. This 
can be done, e.g., by adding one or more non-native interchain 
disulfide bonds. The sites for these cross-links can be 
deduced from the known structures of the heterodimeric 

25 hormones. For example, a suitable site in hCG could be to 
place cysteine residues at a subunit residue Lys45 and 0 
subunit residue Glu21, replacing a salt bridge (non-covalent 
bond) with a disufide bond (covalent bond) . Another object of 
the present invention are PEGylated or other chemically 

30 modified forms of the hybrid proteins. 

A further object of the present invention is a DNA 
molecule comprising the DNA sequence coding for the above 
hybrid protein, as well as nucleotide sequences substantially 
the same. "Nucleotide sequences substantially the same" 

35 includes all other nucleic acid sequences which, by virtue of 
the degeneracy of the genetic code, also code for the given 
amino acid sequence. 

For the production of the hybrid protein of the 
invention, the DNA sequence (a) is obtained from existing 



WO 97/30161 



PCI7US97/02315 



clones, as is (b) . The DNA sequence coding for the desired 
sequence (a) is ligated with the DNA sequence coding for the 
desired sequence (b) . Two of these fused products are inserted 
5 and ligated into a suitable plasmid or each into a different 
plasmid. Once formed, the expression vector, or the two 
expression vectors, is introduced into a suitable host cell, 
which then expresses the vector (s) to yield the hybrid protein 
of the invention as defined above . 

10 The preferred method for preparing the hybrid of the 

invention is by way of PCR technology using oligonucleotides 
specific for the desired sequences to be copied from the clones 
encoding sequences (a) and (b) . 

Expression of any of the recombinant proteins of the 

15 invention as mentioned herein can be effected in eukaryotic 

cells (e.g., yeasts, insect or mammalian cells) or prokaryotic 
cells, using the appropriate expression vectors. Any method 
known in the art can be employed. 

For example the DNA molecules coding for the proteins 

20 obtained by any of the above methods are inserted into 

appropriately constructed expression vectors by techniques well 
known in the art (see Sambrook et al, 1989) . Double stranded 
cDNA is linked to plasmid vectors by homopolymeric tailing or 
by restriction linking involving the use of synthetic DNA 

25 linkers or blunt-ended ligation techniques: DNA ligases are 
used to ligate the DNA molecules and undesirable joining is 
avoided by treatment with alkaline phosphatase. 

In order to be capable of expressing the desired 
protein, an expression vector should comprise also specific 

30 nucleotide sequences containing transcriptional and 

translational regulatory information linked to the DNA coding 
the desired protein in such a way as to permit gene expression 
and production of the protein. First in order for the gene to 
be transcribed, it must be preceded by a promoter recognizable 

35 by RNA polymerase, to which the polymerase binds and thus 

initiates the transcription process . There are a variety of 
such promoters in use, which work with different efficiencies 
(strong and weak promoters) . 
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For eukaryotic hosts, different transcriptional and 
translational regulatory sequences may be employed, depending 
on the nature of the host . They may be derived form viral 
sources, such as adenovirus, bovine papilloma virus, Simian 
virus or the like, where the regulatory signals are associated 
with a particular gene which has a high level of expression. 
Examples are the TK promoter of the Herpes virus, the SV4 0 
early promoter, the yeast gal4 gene promoter, etc. 
Transcriptional initiation regulatory signals may be selected 
which allow for repression and activation, so that expression 
of the genes can be modulated. 

The DNA molecule comprising the nucleotide sequence 
coding for the hybrid protein of the invention is inserted into 
a vector (s), having the operably linked transcriptional and 
translational regulatory signals, which is capable of 
integrating the desired gene sequences into the host cell. The 
cells which have been stably transformed by the introduced DNA 
can be selected by also introducing one or more markers which 
allow for selection of host cells which contain the expression 
vector. The marker may also provide for phototrophy to a 
auxotropic host, biocide resistance, e.g., antibiotics, or 
heavy metals such as copper, or the like. The selectable 
marker gene can either be directly linked to the DNA gene 
sequences to be expressed, or introduced into the same cell by 
co-transfection. Additional elements may also be needed for 
optimal synthesis of proteins of the invention. 

Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient 
cells that contain the vector may be recognized and selected 
from those recipient cells which do not contain the vector; the 
number of copies of the vector which are desired in a 
particular host; and whether it is desirable to be able to 
"shuttle" the vector between host cells of different species. 

Once the vector (s) or DNA sequence containing the 
construct (s) has been prepared for expression, the DNA 
construct (s) may be introduced into an appropriate host cell by 
any of a variety of suitable means: transformation, 
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transfection, conjugation, protoplast fusion, elect roporation, 
calcium phosphate-precipitation, direct microinjection, etc. 

Host cells may be either prokaryotic or eukaryotic. 
5 Preferred are eukaryotic hosts, e.g., mammalian cells, such as 
human, monkey, mouse, and Chinese hamster ovary {CHO) cells, 
because they provide post -translational modifications to 
protein molecules, including correct folding or glycosylation 
at correct sites. Also, yeast cells can carry out post- 
10 translational peptide modifications including glycosylation. A 
number of recombinant DNA strategies exist which utilize strong 
promoter sequences and high copy number of plasmids which can 
be utilized for production of the desired proteins in yeast. 
Yeast recognizes leader sequences on cloned mammalian gene 
15 products and secretes peptides bearing leader sequences (i.e., 
pre -peptides) . 

After the introduction of the vector (s), the host 
cells are grown in a selective medium, which selects for the 
growth of vector- containing cells. Expression of the cloned 
20 gene sequence (s) results in the production of the desired 
proteins . 

Purification of the recombinant proteins is carried 
out by any one of the methods known for this purpose, i.e., any 
conventional procedure involving extraction, precipitation, 

25 chromatography, electrophoresis, or the like. A further 
purification procedure that may be used in preference for 
purifying the protein of the invention is affinity 
chromatography using monoclonal antibodies which bind the 
target protein and which are produced and immobilized on a gel 

30 matrix contained within a column. Impure preparations 

containing the recombinant protein are passed through the 
column. The protein will be bound to the column by the 
specific antibody while the impurities will pass through. 
After washing, the protein is eluted from the gel by a change 

35 in pH or ionic strength. 

The term "hybrid protein", as used herein, 
generically refers to a protein which contains two or more 
different proteins or fragments thereof. 
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As used herein, "fusion protein" refers to a hybrid 
protein, which consists of two or more proteins, or fragments 
thereof, linked together covalently. 
5 The term "aggregation", as used herein, means the 

formation of strong specific non-covalent interactions between 
two polypeptide chains forming a complex, such as those 
existing between the a and 0 subunit of a heterodimeric 
hormone (such as FSH, LH, hCG or TSH) . 

10 The terms "ligand" or "ligand protein", as used 

herein, refer to a molecule, other than an antibody or an 
immunoglobulin, capable of being bound by the ligand-binding 
domain of a receptor; such molecule may occur in nature, or may 
be chemically modified or chemically synthesised. 

15 The term "ligand-binding domain", as used herein, 

refers to a portion of the receptor that is involved in binding 
a ligand and is generally a portion or essentially all of the 
extracellular domain. 

The term "receptor", as used herein, refers to a 

20 membrane protein, whose binding with the respective ligand 
triggers secondary cellular responses that result in the 
activation or inhibition of intracellular process . 

In a further aspect, the present invention provides 
the use of the hybrid protein as a medicament. The medicament 

25 is preferably presented in the form of a pharmaceutical 

composition comprising the protein of the invention together 
with one or more pharmaceutically acceptable carriers and/or 
excipients. Such pharmaceutical compositions represent yet a 
further aspect of the invention. 

30 

BRIEF DESCRIPT ION OF THE DRAWINGS 

The invention will be better understood by reference 
to the appended drawings, in which: 

Figures 1(a) and Kb) show the TBP (20-161) -hCGa and 
35 TBP (20-161) -hCG/3 constructs, respectively, and the 
corresponding sequences (SEQ ID N0S:l-4) . 

Figures 2(a) and 2(b) show the TBP (20-190) -hCGa and 
TBP (20-190) -hCG0 constructs, respectively, and the 
corresponding sequences (SEQ ID NOS:5-8). 
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Figure 3 is a schematic summary of the constructs of 
Figures 1 and 2 showing p55 TNFR1, TBP1 and TBP1 fusion 
contructs. The linker sequences shown on the last two lines 
5 are SEQ ID NO: 9 (Ala-Gly-Ala-Ala-Pro-Gly) and SEQ ID NO: 10 
(Ala-Gly-Ala-Gly) . 

Figure 4 is a graph illustrating the dose dependent 
protective effect of CHO cell expressed TBP-hCG (20-190) on 
TNFa-induced cytotoxicity on BT-20 cells and various controls. 
10 Figure 5 is a graph illustrating the dose dependent 

protective effect of COS cell expressed TBP-hCG(20-190) on 
TNFa-induced cytotoxicity on BT-20 cells and various controls. 

Figure 6 is a graph illustrating the dose dependent 
protective effect of affinity purified CHO cell expressed TBP- 
15 hCG (20-161) on TNFa-induced cytotoxicity on BT-20 cells and 
various controls. 

"FTVT.OTt Traqr RIPTION OF THE preferred embodiments 

The invention will now be described by means of the 
20 following Examples, which should not be construed as in any way 
limiting the present invention. 



Materials and Methods 

Cell lines used in this study were obtained from the 
American Type Culture Collection (ATCC) , Rockville, Maryland, 
unless otherwise specified. The CHO-DUKX cell line was 
obtained from L. Chasin at Columbia University through D. 
Houseman at MIT (39) . The CHO-DUKX cells, which lack a 
functional gene for dihydrof olate reductase, were routinely 
maintained in complete a-plus Modified Eagles Medium (a (+) MEM) 
supplemented with 10% fetal bovine serum (FBS) . The COS- 7 
cells were routinely maintained in Dulbecco's Modified Eagles 
Medium (DMEM) supplemented with 10% FBS. Unless specified 
otherwise, cells were split to maintain them in log phase of 
growth, and culture reagents were obtained from GIBCO (Grand 
Island, New York) . 



- 12 - 



WO 97/30161 



PCT/US97/02315 



1. Assembly of the genetic constructs encoding the 
hybrid proteins , 

The numbering assignments for the p55 TNF receptor 
are based on the cloning paper from Wallach (40) , while the 
numbering assignments for the hCG subunits are based on the 
numbering assignments from the Fiddes cloning papers (41,42). 
The designation TBP, or TNF binding protein, refers to the 
extracellular domain portions of the TNF receptors capable of 
binding TNF. In these Examples, the DNA constructs will be 
named as TBP-hybrid proteins, with the partner and region of 
TBP indicated in the construct nomenclature. All of the TBP- 
hCG constructs contain the human growth hormone (hGH) signal 
peptide in place of the native p55 signal sequence. In 
addition, the hGH signal peptide has been placed so that it 
immediately precedes TBP residue Asp20, which is anticipated to 
make this the first residue in the mature, secreted protein. 
These modifications are not essential to the basic concept of 
using hCG as a partner of the hybrid protein. 

The DNAs encoding the hybrid proteins were 
constructed using PCR methodology (43) . 

a. TBP1 (20-161) -hCG 

The initial TBP-hCG construct was engineered to 
contain the ligand binding domain from the extracellular region 
of the p55 TNF receptor (from Asp20 inclusive of residue 
Cysl6l) fused though a short linker to the hCG a and /3 
subunits (starting at residues aCys7 or £Pro7, respectively) . 
This construct, hereafter referred to as TBP1 (20-161) -hCG, is a 
heterodimer of two modified hCG subunits, TBP1 (20- 161) -hCGa and 
TBP1 (20-161) -hCGiS . 

The oligodeoxynucleotide primers used for the 
TBPl (20-161) -hCGa construct were: 

primer l{afi) TTT TCT CGA GAT GGC TAC AGG TAA GCG 
CCC (SEQ ID NO: 11) 

primer 2 (a) ACC TGG GGC AGC ACC GGC ACA GGA GAC ACA 
CTC GTT TTC (SEQ ID NO: 12) 

primer 3(a) TGT GCC GGT GCT GCC CCA GGT TGC CCA GAA 
TGC ACG CTA CAG (SEQ ID NO: 13) 
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primer 4(a) TTT TGG ATC CTT AAG ATT TGT GAT AAT AAC 
AAG TAC (SEQ ID NO : 14 ) 

These and all of the other primers described in these 
Examples were synthesized on an Applied Biosystems Model 392 
DNA synthesis machine (ABI , Foster City, California) , using 
phosphoramidite chemistry. 

Since both of the TBP-hCG subunit constructs have the 
same 5 '-end (i.e., the 5' -end of the hGH/TBP construct), primer 
l(a0) was used for both TBP-hCG subunit constructs. The 
other primers used for the TBP1 (20-161) -hCG0 construct were: 

primer 2(0) CCG TGG ACC AGC ACC AGC ACA GGA GAC 
ACA CTC GTT TTC (SEQ ID NO: 15) 

primer 3 (0) TGT GCT GGT GCT GGT CCA CGG TGC CGC 
CCC ATC AAT (SEQ ID NO: 16) 

primer 4(0) TTT TGG ATC CTT ATT GTG GGA GGA TCG 
GGG TG (SEQ ID NO: 17) 

Primers 2(a) and 3(a) are reverse complements, and 
cover both the 3' -end of the coding region for the p55 
extracellular domain, and the 5' -end of the hCG a subunit. 
Similarly, primers 2(/S) and 3(0) are also reverse 
complements, and cover both the 3 ' -end of the coding region for 
the p55 extracellular domain, and the 5 '-end of the hCG 0 
subunit . 

Two PCR reactions were run for each of the two TBP- 
hCG subunit constructs. The first used primers l(a0) and 2 
(a or 0) , and used as the template a plasmid encoding soluble 
p55 residues 20-180 preceded by the hGH signal peptide (plasmid 
pCMVhGHspcDNA.pA4) . The second used primers 3 (a or 0) and 4 
(a or 0) , and used as the template either plasmid pSVL-hCGa 
or pSVL-hCGjS (44) . The PCR was performed using Vent (TM) 
polymerase from New England Biolabs (Beverly, Massachusetts) in 
accordance with the manufacturer's recommendations, using for 
each reaction 25 cycles and the following conditions: 

100 fig of template DNA 

1 ng of each primer 

2U of Vent (TM) polymerase (New England Biolabs) 
denaturation at 99°C for 30 seconds 
annealing at: 59°C for 30 seconds for primers l(a0) and 2(a) 
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59°C for 30 seconds for primers 3(a) and 4(a) 
57°C for 30 seconds for primers l(a£) and 2(0) 
63°C for 30 seconds for primers 3(/3) and 4(0) 
5 extension at 75°C for 75 seconds. 

The PCR products were confirmed to be the expected 
size by electrophoresis in a 2% agarose gel and ethidium 
bromide staining. The fragments were then purified by passage 
over a Wizard column (Promega) in accordance with the column 

10 manufacturer's recommendations. 

The final coding sequence for TBP1 (20-161) -hCGa was 
assembled by fusion PCR using primer l(a0) and primer 4(a), 
and using as template the purified products from the p55 and 
hCG a fragments obtained from the first PCR reactions. First 

15 the two templates, which due to the overlap between primers 
2(a) and 3(a) could be denatured and annealed together, were 
passed through 10 cycles of PCR in the absence of any added 
primers. The conditions for these cycles were essentially the 
same as those used earlier, except that the annealing was done 

20 at 67°C and the extension was performed for 2 minutes. At the 
end of these 10 cycles, primers l(a/3) and 4(a) were added, 
and another 10 cycles were performed. The conditions for this 
final set of reactions was the same as used earlier, except 
that an annealing temperature of 59 °C was used, and the 

25 extension was performed for 75 seconds . 

Analysis of the products of this reaction by 
electrophoresis in a 1% agarose gel confirmed that the expected 
fragment of about llOObp was obtained. The reaction was passed 
over a Wizard column to purify the fragment, which was then 

30 digested with Xbal and BamHI and re -purified in a 0.7% low- 
melting point agarose gel. The purified fragment was subcloned 
into plasmid pSVL (Pharmacia) , which had first been digested 
with Xbal and BamHI and gel purified on a 0.8% low-melting 
point agarose gel. Following ligation with T4 ligase, the 

35 mixture was used to transform AG1 E. coli and then plated onto 
LB/ampicillin plates for overnight culture at 37°C. Plasmid 
DNAs from ampicillin-resistant colonies were analyzed by 
digestion with Xhol and BamHI to confirm the presence of the 
insert (which is excised in this digest) . Six clones were 
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found to contain inserts, and one (clone 7) was selected for 
further advancement and designated pSVLTBPhCGa (containing 
TBP1 (20-161) -hCGa) . Dideoxy DNA sequencing (using Sequenase™ , 
5 U.S. Biochemicals, Cleveland, Ohio) of the insert in this 

vector confirmed that the construct was correct, and that no 
undesired changes had been introduced. 

The final coding sequence for TBP1 (20-161) -hCG0 was 
assembled in a manner similar to that described for TBPK20- 

10 161) -hCGa using fusion PCR and primers l(a/S) and 4(0), and 

using as template the purified products from the p55 and hCG 
0 fragments obtained from the first PCR reactions. The 
resulting pSVL plasmid containing the insert of interest was 
designated pSVLTBPhCG£ . 

15 b. TBP (20-190) -hCG 

A second set of TBP-hCG proteins was prepared by 
modification of the TBP (20-161) -hCG constructs to produce an 
analog containing TBP spanning from Asp20 to Thrl90, in place 
of the 20-161 region in the initial analog. This was done by 

2o replacing the fragment between the Bglll and Xbal sites in 

plasmid pSVLTBPhCGc* with a PCR fragment containing the change. 
This PCR fragment was generated using fusion PCR. The primers 
were : 

primer 1 TTT TAG ATC TCT TCT TGC ACA GTG GAC 
25 (SEQ ID NO:18) 

primer 2 TGT GGT GCC TGA GTC CTC AGT (SEQ ID 
NO: 19) 

primer 3 ACT GAG GAC TCA GGC ACC ACA GCC GGT GCT 
GCC CCA GGT TG (SEQ ID NO: 20) 
30 primer 4 TTT TTC TAG AGA AGC AGC AGC AGC CCA TG 

(SEQ ID NO: 21) 
Primers 1 and 2 were used to generate the sequence 
coding the additional p55 residues from 161-190. The PCR 
reaction was performed essentially as described earlier, using 
35 1 jig of each primer and pUC-p55 as template. Similarly, 

primers 3 and 4 were used to generate by PCR the linker between 
the 3 ' -end of the TBP-coding region, and the 5 ' -end of the hCG 
a subunit coding region, using as a template plasmid 
pSVLTBPhCGa . Products from these PCR reactions were confirmed 
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to be the correct size (about 29 6 bp and 121 bp respectively) 
by polyacrylatnide gel electrophoresis (PAGE) on an 8% gel, and 
were then purified using a Wizard column. The design of 
primers 2 and 3 was such that they contained a region of 
overlap, so that the two PCR products (from primers 1 and 2, 
and from primers 3 and 4) could be annealed for fusion PCR with 
primers 1 and 4. Subsequent to the fusion reaction, the 
desired product of about 4 00 bp was confirmed and purified 
using a 1.5% agarose gel and a Wizard column. This DNA was 
then digested with Bglll and Xbal, and ligated with Bglll/Xbal- 
digested pSVLTBPhCGa . The presence of an insert in plasmids 
isolated from transformed AG1 E. coli was confirmed by 
digestion with Bglll and Xbal. The new construct was 
designated pSVLTBP (20-190 ) -hCGa . 

Similarly, plasmid pSVLTBPhCG/3 was modified by 
substitution of the Bglll-Xcml fragment. However, this was 
done by subcloning of a single PCR product, rather than with a 
fusion PCR product. Primers 1 and 2b (see below) were used 
with pUC-p55 as the template. 

primer 2b TTT TCC ACA GCC AGG GTG GCA TTG ATG GGG 
CGG CAC CGT GGA CCA GCA CCA GCT GTG GTG 
CCT GAG TCC TCA GTG (SEQ ID NO: 22) 

The resulting PCR product (about 337bp) was confirmed 
and purified as described above, digested with Bglll and Xcml, 
and then ligated into Bglll/Xbal-digested pSVLTBPhCG/3 . The 
presence of an insert in plasmids isolated from transformed 
AGl E . coli was confirmed by digestion with Bglll and Xcml. 
The new construct was designated pSVLTBP (20-190) -hCG/3 . 

The new constructs were subsequently confirmed by DNA 
sequencing. 

In addition to producing these new pSVL-based 
plasmids, these constructs were also subcloned into other 
expression vectors likely to be more suitable for stable 
expression in CHO, particularly vector Da, previously described 
as plasmid CLH3AXSV2DHFR (45) . This was accomplished by 
converting a BamHI site flanking the inserts in the pSVL- based 
vectors to an Xhol site , and then excising the insert with Xhol 
and cloning it into Xhol digested Da. 
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2 . Transient and stable expression of the hybrid 
proteins 

Transfections of COS-7 cells (ATCC CRL 1651, ref. 
46) for transient expression of the TBP-hCG hybrid proteins 
were performed using electroporation (47) . Exponentially 
growing COS-7 cells were removed by trypsinization, collected 
by gentle centrifugation (800 rpm, 4 minutes), washed with cold 
phosphate buffered saline (PBS), pH 7.3-7.4, and then 
repelleted by centrifugation. Cells were resuspended at a 
concentration of 5xl0« cells per 400 fil cold PBS and mixed with 
10 fig of plasmid DNA in a prechilled 2 mm gap electroporation 
cuvette. For cotransf ections, 5 fig of each plasmid were used. 
The cuvette and cells were chilled on ice for a further 10 
minutes, and then subjected to electroporation using a BTX 
Model 600 instrument and conditions of 125 V, 950/iF and R=8. 
Afterward the cells were set to cool on ice for 10 minutes, 
transferred to a 15 ml conical tube containing 9 . 5 ml complete 
medium (Dulbecco's modified Eagle's medium (DMEM) supplemented 
with 10% fetal bovine serum (FBS) and 1% L-glutamine) at room 
temperature, and left at room temperature for 5 minutes. After 
gentle mixing in the 15 ml tube, the entire contents was seeded 
onto two P100 plates and placed into a 37°C, 5% C0 2 incubator. 
After 18 hours the media was changed, and in some cases the new 
media contained only 1% or 0% FBS. After another 72 hours, the 
conditioned media was harvested, centrifuged to remove cells, 
and then stored frozen at -70°C, 

Transfections of CHO-DUKX (CHO) cells for transient 
or stable expression were performed using calcium phosphate 
precipitation of DNA. Twenty- four hours prior to the 
transfection, exponentially growing CHO cells were plated onto 
100 mm culture plates at a density of 7.5xl0 5 cells per plate. 
On the day of the transfection, 10 fig of plasmid DNA was 
brought to 0.5 ml in transfection buffer (see below), 31 fxl of 
2 M CaCl 2 were added, the DNA-CaCl 2 solution was mixed by 
vortexing, and left to stand at room temperature for 45 
minutes. After this the media was aspirated from the plates, 
the DNA was added to the cells using a sterile plastic pipette, 
and the cells were left at room temperature for 20 minutes. At 
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the end of this period, 5 ml of complete or( + )MEM containing 10% 
FBS was added to the plates, which were incubated at 37°C for 
4-6 hours. The media was then aspirated off the plates, and 
5 the cells were subjected to a glycerol shock by incubating them 
with a solution of 15% glycerol in transfection buffer at 37°C 
for 3.5 minutes. After removal of the glycerol solution, the 
cells were washed twice with PBS, refed with 10 ml complete 
a ( + ) MEM , 10% FBS, and returned to the 37°C incubator. For 

10 stable transf ections, after 48 hours the cells were split 1:10 
and fed with selection medium (complete or-minus MEM {lacking 
nucleosides), 10% dialyzed FBS, and 0.02 jxM methotrexate). 
Non-transfected (non-resistant) cells were typically eliminated 
in 3-4 weeks, leaving a population of transf ected, 

15 methotrexate-resistant cells. 

3. Quantitation P f expression 

Secretion of the hybrid proteins by transfected cells 
was assessed using a commercial assay kit for soluble p55 (R&D 
Systems; Minneapolis, Minnesota) in accordance with the 
20 manufacturer's instructions. This assay also provides an 
estimate of the hybrid protein levels in conditioned and 
processed media, which served as the basis for selecting doses 
to be used in the bioassay. 

4. Assessment of heterodimer formation 

25 To assess the ability of the TBP-hCG subunit fusions 

to combine and form heterodimers, a sandwich immunoassay using 
antibodies to the hCG subunits was performed. In this assay, a 
monoclonal antibody to the hCG p subunit is coated onto 
microtiter plates and used for analyte capture. The primary 

30 detection antibody is a goat polyclonal raised against the 
human TSH a subunit (#082422G - Biodesign International; 
Kennenbunkport , Maine) , which is in turn detected using a horse 
radish peroxidase conjugated rabbit ant i -goat polyclonal 
antibody (Cappel; Durham, North Carolina) . 

35 Several different anti-hCG 0 subunit antibodies 

were used in this work, all of which show no detectable cross- 
reactivity with the free a subunit. One of these antibodies 
(3/6) is used in the commercially available MAIAclone hCG assay 
kit (Biodata; Rome, Italy) . 
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High-protein binding microtiter plates (Costar #3590) 
were coated with capture antibody by incubation (2 hours at 
37°C) with 100 /xl/well of a 5 /xg/ml solution of antibody in 
5 coating buffer (PBS, pH 7.4, 0.1 mM Ca", 0.1 mM Mg* ♦ ) . After 
washing once with wash solution (PBS, pH 7.4 + 0.1% Tween 20) 
the plate is blocked by completely filling the wells (-4 00 
/il/well) with blocking solution (3% bovine serum albumin (BSA; 
fraction V - A-4503 Sigma) in PBS, pH 7.4) and incubating for 

10 one hour at 37°C or overnight at 4°C. The plate is then washed 
twice with wash solution, and the reference and experimental 
samples, diluted in diluent (5 mg/ml BSA in PBS, pH 7.4) to 
yield a 100 pi volume, are added. After incubating the samples 
and the plate for two hours at 37°C, the plate is again twice 

15 washed with wash solution. The primary detection antibody, 

diluted 1:5000 in diluent, is added (100 nl/well) and incubated 
for one hour at 37°C. The secondary detection antibody (HRP 
conjugated rabbit anti-goat Ig) , diluted 1:5000 in diluent, is 
added (100 jil/well) and after incubation for one hour at 37°C, 

20 the plate is washed three times with wash solution. One 

hundred fil of TMB substrate solution (Kirkegaard and Perry 
Laboratories) is added, the plate is incubated 20 minutes in 
the dark at room temperature, and then the enzymatic reaction 
is stopped by addition of 50 /xl/well 0.3M H 2 S0 4 . The plate is 

25 then analyzed using a microtiter plate reader set for a 
wavelength of 450 nm. 

5. Partial purification 

To better quantitate the activities of these hybrid 
proteins, TBP-hCG hybrid proteins were partially purified by 
30 immunoaf f inity chromatography. The antibody used was a 

monoclonal commercially available from R&D Systems (MAB #225) . 
The column was CNBr-activated sepharose, charged with the 
antibody by following the manufacturer's (Pharmacia) 
instructions . 

35 Conditioned media was collected from confluent T-175 

flasks of each line using daily harvests of 50 ml SFMII media 
(GIBCO) , five harvests for each line. The collections were 
subjected to centrif ugation (1000 RPM) to remove cellular 
debris. The material was then assayed for TBP content using 
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the commercial immunoassay and concentrated (Centricon units by 
Amicon; Beverly, Massachusetts) so that the apparent TBP 
concentration was about 50 ng/ml . 

Ten ml of the concentrated TBP-hCG (sample #18873) 
was brought to approximately 1 M NaCl by addition of NaCl and 
adjustment of the solution to a conductivity of approximately 
85 mS/cm. This was passed through a 0.5 ml anti-TBP 
immunoaf f inity column. The flow- through was collected and run 
through the column a second time. After this the column was 
washed with 1 M NaCl in PBS. The bound TBP (20-161) -hCG was 
collected after elution with 50 mM citric acid (pH 2.5). The 
eluate (approximately 7 ml) was concentrated by filtration 
using Amicon Centricon- 10 ' s in accordance with the 
manufacturer's (Amicon) instructions, to a volume of 
approximately 200 /xl . Approximately 800 /xl of PBS was added to 
bring the sample volume to 1 ml, which was stored at 4°C until 
tested by bioassay. 

6 . Assessment of anti-TNF activity 

Numerous in vitro TNF- induced cytotoxicity assays 
have been described for evaluating analogs of soluble TNF 
receptors. We utilized an assay employing a human breast 
carcinoma cell line, BT-20 cells (ATCC HTB 19) . The use of 
these cells as the basis for a TNF bioassay has been described 
previously (48) . These cells are cultured at 37°C in RPMI 1640 
media supplemented with 10% heat -inactivated FBS . The cells 
were grown to a maximum 80-90% confluence, which entailed 
splitting every 3-4 days with a seeding density of about 3xl0 6 
cells per Tl75cm 2 flask. 

The BT-20 assay uses the inclusion of a cellular 
stain, crystal violet, as a detection method to assess survival 
of cells after treatment with TNF. Dead cells are unable to 
take up and retain the dye . 

In brief, the protocol used for the assay of anti-TNF 
activity is the following. Recombinant human TNFa (R&D 
Systems) and the experimental samples are constituted in media 
(RPMI 1640 with 5% heat -inactivated FBS) and added to the wells 
of 96-well culture plates. The cells are then plated into 
these wells at a density of 1x10 s cells/well. The quantity of 
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TNFa added was determined earlier in titration studies, and 
represents a dose at which about 50% of the cells are killed. 

After addition of the samples, the cells are cultured 
5 for 4 8 hours at 3 9°C, after which the proportion of live cells 
is determined using crystal violet staining and a microtiter 
plate reader (570 nm) . 



,TS 



1. Constructs under study 

The designs of the hybrid proteins studied are 
briefly summarized below; two control proteins, a monomeric 
soluble p55 (r-hTBP-1) and a dimeric TBP- immunoglobulin fusion 
protein (TBP-IgG3) (prepared essentially as described in (10)), 
were studied for comparative purposes. 



Construct 

r-hTBP-1 
TBP- IgG3 

TBP (20-161) -hCG 

TBP (20-190) -hCG 



TBP N-term 

mix of 9 and 20 
mix of 9 and 20 



TBP C-term 

180 
190 

161 

190 



Fusion 
partner 



IgG3 heavy chain 
constant region 



hCGa and hCG0 
(heterodimer) 



hCGa and hCG/3 
(heterodimer) 



The sequences of the DNAs encoding, TBP (20-190) -hCG 
and TBP (20-161) -hCG are provided in Figures 1 and 2, 
respectively. A schematic summary of the constructs is 
provided in Figure 3 . 

2. gecretipn of, TBP-hCG proteins 

All of the constructs tested were found to be 
produced and secreted into culture media by transfected 
mammalian cells. Data illustrating this are shown in Tables 1 
and 2 . 

3. TBP-hCG (a//3) fusion proteins assemble into 

heterodimers 

The combination of TBP-hCGa and TBP-hCG0 was 
confirmed using the sandwich assay for the hCG heterodimer. 
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Only the combined transf ection of a and £ subunit fusions 
resulted in heterodimer detection (Table 3) . 

4 . TBP-hCG hybrid proteins exhibit increased 
activity over TBP monomer 

Hybrid proteins produced in either COS- 7 or CHO cells 
were found to be potent inhibitors of TNFa in the BT-20 
bioassay. Some of the samples tested are summarized in Table 
4. 

Negative controls (conditioned media from mock 
transf ections) were included for the lx media samples. 

As illustrated in Figures 4-6 (points on y-axis) , 
addition of TNF (2.5 ng/ml) results in a clear reduction in 
live cell number (as assessed by OD 570) . In every case, 
active samples have as a maximal protective effect the 
restoration of cell viability to the level seen in the absence 
of added TNF (i.e., the control labeled "cells alone"). 

The positive controls, r-hTBP-l and TBP-IgG3, are 
both protective, showing a clear dose -dependence and ED50s of 
approximately 100 ng/ml for the r-hTBP-l (Figs. 4-6) and about 
1.5 ng/ml for TBP-IgG3 (Fig. 4) respectively. 

The TBP-hCG constructs from lx media (CHO or COS) or 
from the immunopurif ication show dose -dependent protection, 
with approximate ED50s ranging from 2-11 ng/ml (Figs. 4-6). 

The results from the in vitro bioassay are reported 
in Table 5. The data indicate that the hybrid proteins inhibit 
TNF cytotoxicity, and that they are substantially more potent 
than the TBP monomer. The negative controls were devoid of 
protective activity. 

In addition to the possibility that dimerization of 
TBP may increase potency, it is also possible that the activity 
of the hybrid proteins are not related to dimeric interaction 
with TBP, but rather to steric inhibition due to the partner of 
the hybrid interfering with soluble TBP/TNF binding to cell- 
surface TNF receptors. 

All references cited herein, including journal 
articles or abstracts, published or corresponding U.S. or 
foreign patent applications, issued U.S. or foreign patents, or 
any other references, are entirely incorporated by reference 
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herein, including all data, tables, figures, and text presented 
in the cited references. Additionally, the entire contents of 
the references cited within the references cited herein are 
5 also entirely incorporated by reference. 

Reference to known method steps, conventional method 
steps, known methods or conventional methods is not in any way 
an admission that any aspect, description or embodiment of the 
present invention is disclosed, taught or suggested in the 

10 relevant art. 

The foregoing description of the specific embodiments 
will so fully reveal the general nature of the invention that 
others can, by applying knowledge within the skill of the art 
(including the contents of the references cited herein) , 

!5 readily modify and/or adapt for various applications such 

specific embodiments, without undue experimentation, without 
departing from the general concept of the present invention. 
Therefore, such adaptations and modifications are intended to 
be within the meaning and range of equivalents of the disclosed 

20 embodiments, based on the teaching and guidance presented 
herein. It is to be understood that the phraseology or 
terminology herein is for the purpose of description and not of 
limitation, such that the terminology or phraseology of the 
present specification is to be interpreted by the skilled 

25 artisan in light of the teachings and guidance presented 

herein, in combination with the knowledge of one of ordinary 
skill in the art. 



30 



35 
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TABLES 



Table 1: COS-7 transient expression (TBP 
ELISA) 


Hybrid Proteb 


Concentration 
(pg/ml) 


TBP1 


66 


TBP-nCGa(20-161) 


5.1 


TBP-hCGp(20-161) 


0.5 


TBP-hCG(20-161) 


2.7 


control 


<0.25 



Constructs were expressed using pSVL (Pharmacia) 



Table 2: COS-7 transient expression (TBP 
ELISA) 


Hybrid proteb 


Concentration 
(ng/ml) 


TBP1 


131 


TBP-hCGa(20-190) 


81 


TBP-hCGp(20-190) 


9 


TBP-hCG(20-190) 


62 


control 


<1 



Constructs were expressed using a mouse 



metallothionein promoter-containing vector -pDot 



- 25 - 



PCT7US97/02315 



Table 3: COS-7 transient expression 
(hCG heterodimer assay) 


Hybrid Protein 


Concentration 
(ng/ml) 


TBP1 


<0.2 


TBP-hCGa(20-190) 


<0.2 


TBP-hCGB(20-190) 


<0.2 


TBP-hCG(20-190) 


38 


control 


<0.2 



Constructs were expressed using a mouse 
metaUothionein promoter-containing vector - pDa 



Table 4:SampIes tested for anti-TNF activity 


Construct 


Cell 
source 


Nature of sample 


r-hTBP-1 


CHO 


purified 


TBP-lgG3 


CHO 


1x conditioned media 


TBP(2(M61)-hCG 


CHO 


immunopurified (anti-TBP) 


TBP(20-190)-hCG 


CHO 


1x conditioned media 


TBP{20-190)-hCG 


COS 


1x conditioned media 
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Table 5 '.Preliminary Assessment of the hybrid proteins in TNF 
Cytotoxicity Assay 


Construct 


Fusion partner 


Mi-TNF activity (ED50) 
in BT-20 bioassaf 


r-hTBP-1 


none 


100 ng/ml 


TBP-lgG3 


lgG3 heavy chain constant region 


1.5 ng/m! 


TBP(20-161)-hCG 


hCGa and hCGp (heterodimer) 


2 ng/ml 


TBP(20-190)-hCG 


hCGa and hCGp (heterodimer) 


8-11 ng/ml 



"The quantitation of material for dosing and estimation of ED50 was made using 
the TBPELISA. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Applied Research Systems ARS Holding N.V. 

(B) STREET: 14 John B. Gorsiraweg 

(C) CITY: Curacao 

(E) COUNTRY: Netherlands Antilles 

(F) POSTAL CODE (ZIP) : 

(A) NAME: CAMPBELL, Robert C. 

(B) STREET: 25 Meadowbrook Drive 

(C) CITY: Wrentham 

(E) STATE: Massachusetts 

(F) COUNTRY : United States of America 

(A) NAME: JAMESON, Bradford A. 

(B) STREET: 76 Robbins Street 

(C) CITY : Milton 

(E) STATE: Massachusetts 

(F) COUNTRY: United States of America 

(A) NAME: CHAPPEL, Scott C. 

(B) STREET: 125 Canton Avenue 

(C) CITY: Milton 

(E) STATE : Massachusetts 

(F) COUNTRY: United States of America 

(ii) TITLE OF INVENTION: HYBRID PROTEINS 
(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: BROWDY AND NEIMARK 

(B) STREET: 419 Seventh Street N.W., Ste. 300 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 
<F) ZIP: 22207 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

<D) SOFTWARE: Patentln Release #1.0, VerBion #1.30 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/011,936 

(B) FILING DATE: 20 February 1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Browdy, Roger L. 
{B) REGISTRATION NUMBER: 25,616 
(C) REFERENCE /DOCKET NUMBER: CAMPBELL=2A PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 628-5197 

(B) TELEFAX: (202) 737-3528 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 278.. 1047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCCACATGGC TACAGGTAAG CGCCCCTAAA ATCCCTTTGG GCACAATGTG TCCTGAGGGG 60 

AGAGGCAGCG ACCTGTAGAT GGGACGGGGG CACTAACCCT CAGGTTTGGG GCTTCTCAAT 120 

CTCACTATCG CCATGTAAGC CCAGTATTTG GCCAATCTCA GAAAGCTCCT CCTCCCTGGA 180 

GGGATGGAGA GAGAAAAACA AACAGCTCCT GGAGCAGGGA GAGTGCTGGC CTCTTGCTCT 240 

CCGGCTCCCT CTGTTGCCCT CTGGTTTCTC CCCAGGC TCC CGG ACG TCC CTG CTC 295 

Ser Arg Thr Ser Leu Leu 
1 5 

CTG GCT TTT GGC CTG CTC TGC CTG CCC TGG CTT CAA GAG GGC AGT GCC 343 
Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
10 15 20 

GAT AGT GTG TGT CCC CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT TCC 391 
Asp Ser Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser 
25 30 35 

ATT TGC TGT ACC AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC TGT 439 
He Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys 
40 45 50 

CCA GGC CCG GGG CAG GAT ACG GAC TGC AGG GAG TGT GAG AGC GGC TCC 487 
Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser 
55 60 65 70 

TTC ACC GCT TCA GAA AAC CAC CTC AGA CAC TGC CTC AGC TGC TCC AAA 535 
Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys 
75 80 85 

TGC CGA AAG GAA ATG GGT CAG GTG GAG ATC TCT TCT TGC ACA GTG GAC 583 
Cys Arg Lys Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val Asp 
90 95 100 

CGG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT TGG 631 
Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp 
105 110 115 

AGT GAA AAC CTT TTC CAG TGC TTC AAT TGC AGC CTC TGC CTC AAT GGG 679 
Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly 
120 125 130 

ACC GTG CAC CTC TCC TGC CAG GAG AAA CAG AAC ACC GTG TGC ACC TGC 727 
Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys 
135 140 145 150 

CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC TCC TGT GCC GGT 775 
His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ala Gly 
155 160 165 

GCT GCC CCA GGT TGC CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC TTC 823 
Ala Ala Pro Gly Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe 
170 175 1B0 
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TCC CAG CCG GGT GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT 871 
Ser Gin Pro Gly Ala Pro He Leu Gin Cys Met Gly Cys Cys Phe Ser 
185 190 195 

AGA GCA TAT CCC ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA 919 
Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin 
200 205 210 

AAG AAC GTC ACC TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC 967 
Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn 
215 220 225 230 



AGG GTC ACA GTC ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GGG TGC 
Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr Gly Cys 
235 240 245 

CAC TGC AGT ACT TGT TAT TAT CAC AAA TCT TA AG 
His Cys Ser Thr Cys Tyr Tyr His Lys Ser 
250 255 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

!xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp 
15 10 15 

Leu Gin Glu Gly Ser Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr He 
20 25 30 

His Pro Gin Asn Asn Ser He Cys Cys Thr Lys Cys His Lys Gly Thr 
35 40 45 

Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg 
50 55 60 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His 
65 70 75 80 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu He 
85 90 95 
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Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys 
180 185 190 

Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys 
195 200 205 

Lys Thr Met Leu Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys 
210 215 220 

Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val 
225 230 235 240 

Glu Asn His Thr Gly Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 
245 250 255 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 279.. 1199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO. -3: 

CTCGAGATGG CTACAGGTAA GCGCCCCTAA AATCCCTTTG GGCACAATGT GTCCTGAGGG 60 

GAGAGGTAGC GACCTGTAGA TGGGACGGGG GCACTAACCC TGAGGTTTGG GGCTTCTGAA 120 

TGTGAGTATC GCCATGTAAG CCCAGTATTT GGCCAATGTC AGAAAGCTCC TGGTCCCTGG 180 

AGGGATGGAG AGAGAAAAAC AAACAGCTCC TGGAGCAGGG AGAGTGCTGG CCTCTTGCTC 240 

TCCGGCTCCC TCTGTTGCCC TGTGGTTTCT CCCCAGGC TCC CGG ACG TCC CTG 293 

Ser Arg Thr Ser Leu 
260 

CTC CTG GCT TTT GGC CTG CTC TGC CTG CCC TGG CTT CAA GAG GGC AGT 341 
Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu Gly Ser 
265 270 275 

GCC GAT AGT GTG TGT CCC CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT 389 
Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn 
280 285 290 

TCG ATT TGC TGT ACC AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC 437 
Ser He Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp 
295 300 305 

TGT CCA GGC CCG GGG CAG GAT ACG GAC TGC AGG GAG TGT GAG AGC GGC 485 
Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly 
310 315 320 325 

TCT TTC ACC GCT TCA GAA AAC CAC CTC AGA CAC TGC CTC AGC TGC TCC 533 
Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser 
330 335 340 

AAA TGC CGA AAG GAA ATG GGT CAG GTG GAG ATC TCT TCT TGC ACA GTG 581 
Lys Cys Arg Lys Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val 
345 350 355 
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GAC COG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT 
Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 
360 365 370 

TGG AGT GAA AAC CTT TTC CAG TGC TTC AAT TGC AGC CTC TGC CTC AAT 
Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn 
375 380 385 

GGG ACC GTG CAC CTC TCC TGC CAG GAG AAA CAG AAC ACC GTG TGC ACC 
Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr 
390 395 400 405 

TGC CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC TCC TGT GCT 
Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ala 
410 415 420 

GGT GCT GGT CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 
Gly Ala Gly Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
425 430 435 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
440 445 450 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTC CCC GCC 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
455 460 465 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 
Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
470 475 480 485 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCT 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
490 495 500 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
505 510 515 

TGC GGG GGT CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCC CGC TTC 
Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Phe 
520 525 530 

CAG GAC TCC TCT TCC TCA AAG GCC CCT CCC CCC AGC CTT CCA AGC CCA 
Gin Asp Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro 
535 540 545 

TCC CGA CTC CCG GGG CCC TCG GAC ACC CCG ATC CTC CCA CAA TAA 
Ser Arg Leu Pro Gly Pro Ser Asp Thr Pro He Leu Pro Gin 
550 555 560 

<2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp 
l 5 10 15 
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Leu Gin Glu Gly Ser Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr He 
20 25 30 

His Pro Gin Asn Asn Ser He Cys Cys Thr Lys Cys His Lys Gly Thr 
35 40 45 

Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg 
50 55 60 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His 
65 70 75 80 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu He 
85 90 95 



Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 278.. 1132 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



TCGAGATGGC TACAGGTAAG CGCCCCTAAA ATCCCTTTGG GCACAATGTG TCCTGAGGGG 60 

AGAGGCAGCG ACCTGTAGAT GGGACGGGGG CACTAACCCT CAGGTTTGGG GCTTTTGAAT 120 

GTGAGTATGG CCATGTAAGC CCAGTATTTG CCCAATCTCA GAAAGCTCCT GGTCCCTGGA 180 

GGGATGGAGA GAGAAAAACA AACAGCTCCT GGAGCAGGGA CACTCCTGGC CTCTTGCTCT 24 0 

GCGGCTCCGT GTGTTGCCCT GTGGTTTCTC CCCACGC TCC CGG ACG TCC CTG CTC 295 

Ser Arg Thr Ser Leu Leu 
310 

CTG GCT TTT GGC CTG CTC TGC CTG CCC TGG CTT CAA GAG GGC AGT GCC 343 
Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
315 320 325 

GAT AGT GTG TGT CCC CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT TCG 391 
Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser 
330 335 340 345 

ATT TGC TGT ACC AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC TGT 43 9 

lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys 
350 355 360 

CCA GGC CCG GGG CAG GAT ACC GAC TGC AGG GAG TGT GAG AGC GGC TCC 487 
Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser 
365 370 375 

TTC ACC GCT TCA GAA AAC CAC CTC AGA CAC TGC CTC AGC TGC TCC AAA 535 
Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys 
380 385 390 

TGC CGA AAG GAA ATG GGT CAG GTG GAG ATC TCT TCT TGC ACA GTG GAC 583 
Cys Arg Lys Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val Asp 
395 400 405 

CGG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT TGG 631 
Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp 
410 415 420 425 . 

AGT GAA AAC CTT TTC CAG TGC TTC AAT TGC ACC CTC TGC CTC AAT GGG 679 
Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Thr Leu Cys Leu Asn Gly 
430 435 440 

ACC GTG CAC CTC TCC TGT CAG GAG AAA CAG AAC ACC GTC TGC ACC TGC 727 
Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys 
445 450 455 

CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC TCC TGT AGT AAC 775 
His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser Asn 
460 465 470 

TGT AAG AAA AGC CTG GAG TGC ACG AAG TTG TCC CTA CCC CAG ATT GAG 823 
Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Ser Leu Pro Gin He Glu 
475 480 485 

AAT GTT AAG GGC ACT GAG GAC TCA GGC ACC ACA GCC GGT GCT GCC CCA 871 
Asn Val Lys Gly Thr Glu Asp Ser Gly Thr Thr Ala Gly Ala Ala Pro 
490 495 500 505 
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GGT TGC CCA QAA TGC ACG CTA CAG GAA ARC CCA TTC TTC TCC CAG CCG 
Gly Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro 
510 515 520 

GGT GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT 
Gly Ala Pro He Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr 
525 530 535 

CCC ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC 
Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val 
540 545 550 

ACC TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA 
Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr 
555 560 565 

GTA ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT 
Val Met Gly Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser 
570 575 580 585 

ACT TGT TAT TAT CAC AAA TCT TAAGGATCCC TCGAG 

Thr Cys Tyr Tyr His Lys Ser 
590 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp 
15 10 15 

Leu Gin Glu Gly Ser Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr He 
20 25 30 

His Pro Gin Asn Asn Ser He Cys Cys Thr Lys Cys His Lys Gly Thr 
35 40 45 

Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg 
50 5S 60 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His 
65 70 75 80 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu He 
85 90 95 



Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys 
115 120 125 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(V) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

iix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 279.. 1287 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTCGAGATGG CTACAGGTAA GCGCCCCTAA AATCCCTTTG GGCACAATGT GTCCTGAGGG 60 

GAGAGGCAGC GACCTGTAGA TGGGACGGGG GCACTAACCC TCAGGTTTGG GGCTTCTGAA 120 

TGTGAGTATC GCCATGTAAG CCCAGTATTT GGCCAATGTC AGAAAGCTCC TGGTCCCTGG 180 

AGGGATGGAG AGAGAAAAAC AAACACCTCC TGGAGCAGGG AGAGTGCTGC CCTCTTGCTC 240 

TCCGGCTCCC TCTGTTGCCC TCTGGTTTCT CCCCAGGC TCC CGG ACG TCC CTG 293 

Ser Arg Thr Ser Leu 
290 

CTC CTG GCT TTT GGC CTG CTC TGC CTG CCC TGG CTT CAA GAG GGC AGT 341 
Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu Gly Ser 
295 300 305 

GCC GAT AGT GTG TGT CCC CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT 389 
Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn 
310 315 320 

TCG ATT TGC TGT ACC AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC 437 
Ser He Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp 
325 330 335 

TGT CCA GGC CCG GGG CAG GAT ACG GAC TGC AGG GAG TGT GAG AGC GGC 485 
Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly 
340 345 350 
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TCC TTC ACC GCT TCA GAA AAC CAC CTC AGA CAC TGC CTC AGC TGC TCC 533 
Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser 
355 360 365 370 

AAA TGC CGA AAG GAA ATG GGT CAG GTG GAG ATC TCT TCT TGC ACA GTG 581 
Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val 
375 380 385 

GAC CGG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT 629 
Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 
390 395 400 

TGG AGT GAA AAC CTT TTC CAG TGC TTC AAT TGC AGC CTC TGC CTC AAT 6 77 

Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn 
405 410 415 

GGG ACC GTG CAC CTC TCC TGC CAG GAG AAA CAG AAC ACC GTG TGC ACC 725 
Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr 
420 425 430 

TGC CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC TCC TGT AGT 773 
Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser 
435 440 445 450 

AAC TGT AAG AAA AGC CTG GAG TGC ACG AAG TTG TGC CTA CCC CAG ATT 821 
Asn Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys Leu Pro Gin He 
455 460 465 

GAG AAT GTT AAG GGC ACT GAG GAC TCA GGC ACC ACA GCT GGT GCT GGT 869 
Glu Asn Val Lys Gly Thr Glu Asp Ser Gly Thr Thr Ala Gly Ala Gly 
470 475 480 

CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG AAG GAG GGC 917 
Pro Arg Cys Arg Pro He Asn Ala Thr Leu Ala Val Glu Lys Glu Gly 
485 490 495 

TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC GGC TAC TGC 965 
Cys Pro Val Cys He Thr Val Asn Thr Thr He Cys Ala Gly Tyr Cys 
500 505 510 

CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC CTG CCT CAG 1013 
Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala Leu Pro Gin 
515 520 525 530 

GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC CGG CTC CCT 1061 
Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser He Arg Leu Pro 
535 540 545 

GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC GTG GCT CTC 1109 
Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala Val Ala Leu 
550 555 560 

AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC TGC GGG GGT 1157 
Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp Cys Gly Gly 
565 570 575 

CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCC CGC TTC CAG GAC TCC 1205 
Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Phe Gin Asp Ser 
580 S85 590 

TCT TCC TCA AAG GCC CCT CCC CCC AGC CTT CCA AGC CCA TCC CGA CTC 1253 
Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro Ser Arg Leu 
595 600 605 610 

CCG GGG CCC TCG GAC ACC CCG ATC CTC CCA CAA T AAGGATCCCT CGAG 1301 
Pro Gly Pro Ser Asp Thr Pro He Leu Pro Gin 
615 620 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu Cys Leu Pro Trp 
15 10 15 

Leu Gin Glu Gly Ser Ala Asp Ser Val Cys Pro Gin Gly Lys Tyr He 
20 25 30 

His Pro Gin Asn Asn Ser He Cys Cys Thr Lys Cys His Lys Gly Thr 
35 40 45 

Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg 
50 55 60 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His 
65 70 75 80 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu He 
85 90 95 



- 41 



WO 97/30161 



PCTYUS97/02315 



Thr Asp Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro 
290 295 300 

Arg Phe Gin Asp Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro 
305 310 315 320 

Ser Pro Ser Arg Leu Pro Gly Pro Ser Asp Thr Pro He Leu Pro Gin 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ala Gly Ala Ala Pro Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Ala Gly Ala Gly 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTCTCGAG ATGGCTACAG GTAAGCGCCC 
(2) INFORMATION FOR SEQ ID NO -.12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ACCTGGGGCA GCACCGGCAC AGGAGACACA CTCGTTTTC 
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(2) INFORMATION FOR SEQ ID NO: 13: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGTGCCGGTG CTGCCCCAGG TTGCCCAGAA TGCACGCTAC AG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTTGGATCC TTAAGATTTG TGATAATAAC AAGTAC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 
CCGTGGACCA GCACCAGCAC AGGAGACACA CTCGTTTTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGTGCTGGTG CTGGTCCACG GTGCCGCCCC ATCAAT 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



- 43 - 



WO 97/30161 PCT/US97/02315 

"(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTTTGGATCC TTATTGTGGG AGGATCGGGG TG 32 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTTTAGATCT CTTCTTGCAC AGTGGAC 27 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 : 
TGTGGTGCCT GAGTCCTCAG T 21 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
ACTGAGGACT CAGGCACCAC AGCCGGTGCT GCCCCAGGTT G 41 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTTTTCTAGA GAAGCAGCAG CAGCCCATG 29 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
TTTTCCACAG CCAGGGTGGC ATTGATGGGG CGGCACCGTG GACCAGCACC AGCTGTGGTG 
CCTGAGTCCT CAGTG 
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CLAIMS 

1 . A hybrid protein comprising two coexpressed 
amino acid sequences forming a dimer, each comprising: 

a) at least one amino acid sequence selected from 
the group consisting of a homomeric receptor, a chain of a 
heteromeric receptor, a ligand, and fragments thereof which 
retain the ligand -receptor binding capability; and 

b) a subunit of a heterodimeric proteinaceous 
hormone, or fragments thereof which retain the ability of the 
subunit to form a heterodimer with other subunits thereof ; 

wherein sequences (a) and (b) are bonded directly or 
through a peptide linker, and in which the sequence (b) in each 
of said two coexpressed sequences are capable of aggregating to 
form a dimer complex. 

2. A hybrid protein in accordance with claim 1, 
wherein said sequence (a) is selected from the group consisting 
of TBP1, TBP2 or fragments thereof still containing the ligand 
binding domain; the extracellular domain of the IFNa/0 
receptor or the IFN-y receptor; a gonadotropin receptor or 
extracellular fragments thereof; antibody light chains or 
fragments thereof, optionally associated with the respective 
heavy chains; antibody heavy chains or fragments thereof; 
antibody Fab domains; and IL-6, IFN-/3, TPO or fragments 
thereof . 

3. A hybrid protein in accordance with claim 1, 
wherein said sequence (b) is selected from the group consisting 
of subunits of hCG, FSH, LH, TSH or inhibin, and fragments 
thereof . 

4. A hybrid protein in accordance with claim 1, 
wherein sequence (a) is linked to the amino terminus of 
sequence (b) . 
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5. A hybrid protein in accordance with claim 1, 
wherein sequence (a) is linked to the carboxy terminus of 
sequence (b) . 

6. A hybrid protein in accordance with claim 1, 
wherein said two coexpressed amino acid sequences each include 
the sequence for TBP1 or the fragment thereof corresponding to 
amino acid residues 20-161 or 20-190 of TBP1, as sequence (a) 
and the respective a and p subunits of hCG or fragments 
thereof, as sequence (b) . 

7. A hybrid protein in accordance with claim 1, 
wherein said two coexpressed amino acid sequences each include 
the extracellular domain of a gonadotropin receptor as sequence 
(a) and the respective a and fi subunits of a gonadotropin as 
sequence (b) . 

8. A hybrid protein in accordance with claim 7, 
wherein said sequence (a) is the FSH receptor extracellular 
domain and sequence (b) is a subunit of FSH. 

9. A hybrid protein in accordance with claim 7, 
wherein said sequences (a) and (b) are linked with a peptide 
linker. 

10. A hybrid protein in accordance with claim 9, 
wherein said peptide linker has an enzyme cleavage site. 

11. A hybrid protein in accordance with claim 10, 
wherein said enzyme cleavage site is a thrombin cleavage site. 

12. A hybrid protein in accordance with claim 10, 
wherein said enzyme cleavage site is recognized and cleaved by 
an enzyme which is found in the ovary. 

13. A hybrid protein in accordance with claim 9, 
wherein said peptide linker serves as a flexible hinge. 
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14. A hybrid protein in accordance with claim 1, 
wherein one or more covalent bonds between the two subunits (b) 
are added. 

15. A DNA molecule encoding a hybrid protein in 
accordance with claim 1 . 

16 . An expression vector containing a DNA molecule 
in accordance with claim 15 . 

17. A host cell containing an expression vector in 
accordance with claim 16 and capable of expressing said hybrid 
protein. 

18. A method for producing hybrid protein comprising 
culturing a host cell in accordance with claim 17 and 
recovering the hybrid protein expressed thereby. 

19. A pharmaceutical composition comprising a hybrid 
protein in accordance with claim 1 and a pharmaceutically 
acceptable carrier and/or excipient. 

20. A method for inducing follicular maturation, 
comprising administering a pharmaceutical composition 
comprising the hybrid protein of claim B to a subject in need 
thereof . 
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Xu> | hOH Signal Soquonca hOH Men 

TCSAcata jet aca a CTAAwceccrAAWCccrTTcra^ 



ccccTc i. c i i .iims cccTCT a&n ick cccaccc tcc cm xea Tec cja ere cto oct jtt aac ct« ctc t»c cto ccc ;oo err 
►sar Ara Thr Sir L*u Lau Lau All Pha Oly Lau Lau Cy» Lau Pro Trp Lau 
♦28 Aip of FtaMW< TBP1 

Sift 2S2 £2S *2I 2££ 5X1 act ere tct ccc caa qui aaa tax atc ca= cct caa xai aat tcc ait tcc tct acc aac to: cac aaa via 
►©In Clu Oly S«r Ala A»p Sar Val Cy< Pro Oln Oly Ly» Tyr lla Hla Pro Oln AmAin St Ha Cy> Cya TV Lya Cya Hli Ly» Sly 

ACCTAeTTCTACAATCXCTCreCACCCCCCCCCCAC CAT ACC CAC TCC AM CAC TCT CAC ACC CCC TCC TTC ACS 6CT TCA CAA AAC CAC CTC 

► Thr Tyr Lau Tyi Aan Aap Cya Pro Oly PfaOly Oln Aap Thr Atp Cya ArjOluCyt 01 u Sar Oly Sar R>a Thr Ala Sar Clu Am Hit Ltu 

ACA CAC TCC CTC ASC TCC TCC AAA TCC CCA AAS BAA ATC CCT CAG CTC CAS ATC TCT TCT TCC ACA CTC CAC CCC CAC ACC CTC TCT CCC TC* 

►ato Hla Cya wau Sar Cya Sar Lya Cya Ara lyt Olu tat Sly Oln Val Qlu lla Sar Sar Cya Thr Val AtpAro Aap Thr Val Cya Sly Cyt 
ACC AAC AAC CAC TAC C&3 CAT TAX TCC ACT CAA AAC CTT TTC CAC TCC TTC AAX TCC ACC CTC TCC CTC AAT CCC ACC CTC CAC CTC TCC TCT 
'Ara Lya Aan Oln Tyr Arg Hla Tyr Trp Sar Olu Aan Lau Pha Oln Cya Phi Aan Cya Sar Lau Cya Lau Aan Cly Thr Val Hla Ltu Sar C/i 

Unkar 

CAC CAC AAA CAC AAC ACC CTC TCC ACC TCC CAT CCA CCT TTC TTT CTA ACA CAA AAC CAC TCT CTC TCC TCT OCC MI «CT CCC CCA 931 

► Oln Clu Lya Oln Aan Thr Val Cya Thr Cya Hla Ala Oly Pha Pha Lau Aig Olu Aan GluCya Val Sar Cya Ala Oly Ala Ala Pra Oly 
•T Cya of hCO alpha 

TCC CCA CAA TCC ACO CTA CAC CAA AM CCA TTC TTC TCC CAC CCC COT CCC CCA ATA CTT CAC TCC ATC CCC TCC TCC TTC TCT ACA CCA TAT 

►cya Pre Olu Cya Thr Lau Oln Olu Aan Pro Pha Pha Sar Gin Pra Oly Ala Pra lla Lau Oln Cya Mai Oly Cya Cyt Pha Sar Are Ala Tyr 

CCC ACT CCA CIA ASC TCC AAC AAC ACC ATC TIC CTC CAA AAC AAC CTC AC* TCA CAC TCC ACT TCC TCT CTA CCT AAA TCA TXT AAC ACC ZTZ 

► Pro Thr Pro Lou Arc Sar Lya Lya TV Mat Lou Val Cln tya Aan Val TV Sar Clu Sar TV Cya Cya Val Ala lya Sar Tyr Am Ars Val 

E CCC TCC CAC TCC ACT ATT TCT TAX TAT CAC AAA TCT TAA C 

r Ala Cya Hit Cya Sar Thr Cya Tyr Tyr Hla Lya Sar ••• | 



Figure 1 (a) 
TBP(20-161)-hCGa FUSION CONSTRUCT 
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► pro Trp 
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i Glu Oly Sar Ala Aap aar Val Cya Pro Oln Gly Lye Tyr 
MctccoctMca xcc ac ii« iac aas «c ter ca see ce* «o« CAS s»i acs sm «c ass sac tsi 

► Lya Cya Hla Lya Oly Thr Tyr Lau Tyr Aan Aap Cya Pre Oly Pre Oly Oln Aap TV Aap Cya Arg Glu Cya i 

TCA CAA AAC CAC CTC ASA CAC TGC CTC ACC TSC TCC AM TGC CCA AAC SAA ATS SST CAS CTS AS ATC TCT TCT 

► Ale Sar Otu Aan Hla Leu Arg Hla Cya Lou Sir Cya Sar Lya Cya Arg Lya Olu MM Oly Oln Val Olu Hi 



AT? tsc TCS ACC 
I la Cya Cya Thr 

C CCC TCC TTC ACC 

r Oly Sar Phi Thr 



Sar Sar Cya Thr Val Aip 



CM SAC acc crs ret osc tsc ass aas a 
► Arg Aap Thr Val Cya Oly Cya Arg Lya A 



: CAG IXC ess 

i Oln Tyr Arg 



TAT TCS AST CM AAC CTT TTC CAC TCC TTC AAT TCC 
Tyr Trp Sir Olu Aan Lau Pha Gin Cya Pha Aan Cya 
AAA CAS AAC ACC STG TCC ACC TSC CAT 



Sar Ltu Cyi Liu 



AATSWACCBTOCACCTCTCCTSCCAS 
►Aan Oly Thr Val Hla Lau Sar Cya Oln Olu Lya Oln Aan Thr Val Cya Thr Cya Hla Ala Oly Pha Pha Lau Arg Olu 
Untof *7 Pro of hCO beta 

ICC TCT CCT CCT OCt COT CCA CSS TCC CSC CCC ATC MT SCC ACe CIS CCT CTS CAS AAS SAG S« TSC CSC GTS 
► sarCyaAla Oly Ala Oly Pro Arg Cya Arg Pra I la Aan Ala Thr Lau Ala Val Olu Lya Olu Oly Cya Pro Vol 



A SST ITC TTT CTA ASA SAA AAC CAC TCT CTC 

Aan Slu Cya Vil 

TSC ATC ACC CTC 



AAC ACC ACC ATC TCT GCC SCC TAG TCC CCC ACC ATS ACC CCC CTC CTC CAS SSC CTC CTC CCS CCC CTS CCT CAS CTC 
►Aan Thr Thr t la Cya Ala Oly Tyr Cya Pro Thr Mat Thr A/gVal Uu Oln Oly Val Lau Pro Ala Lau Pro Oln Val 
CCC CAT CTC CCC TTC SAG TCC ATC CCC CTC CCT GCC TCC CCC CCC CSC CTS AAC CCC CTC CTC TCC TAC CCC CTC CCT 

► Arg Aap Val Arg Pha Olu Sar I la Arg Lau Pro Oly Cya Pro Arg Oly Vat Ain Pre Val Val Sar Tyr Ala Val Ala 
TOT CCA CTC TCC QX CCC A5C ACC ACT CAC TGC CSC CCT CCC AAC SAC CAC CCC TTC ACC TCT CM SAC CCC CSC TTC 

►cya Ala Lau Cya Arg Arg Sar Thr Thr Aap Cya Oly Oly Pro Lya Aap Hla Pro Lau Thr Cya Aap Aip Pro Arg Pha 
TCC TCA AAS CCC CCT CCC CCC AGC CTT CCA ACC CCA TCC t(A CTC CCC CSS CCC TCC CAC ACC IWC ATC CTC CCA CAA T 

► Sar Sar Lya Ala Pro Pro Pro Sor Lau Pro Sar Pre Sir Arg Lau Pre Oly Pre Sar Aap Thr Pre I la Lau Pro Oln •< 



etc ;k AAC TAC 
Val Cya A«n Tyr 

CTC A« TCT OA 
Liu Sar Cya Oln 

CAC CAC TCC TCT 
Gin Aip Sar Sgr 



Figure 1 (b) 



TBP(20-161)-liCGj3 FUSION CONSTRUCT 
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ACC TAC TTO TAC AAT CAC 1ST CCA CCC CCS CG6 CAS CAT ACS CAC TSC ASS SAG TCT GAG ASC SBC TCC TTC ACC OCT TCX CAA AAC CAC C^C 
' the Tyc leu Ty* AJn A*» Cya Ire Sir In ely Ela Ait Ol U» Cy» Axf Clu Cy« elu lax Sly Sax Mm Thx AAA Sex Clu Ai» Mil liu 

ASA CAC TSC CTC ACC TSC TCC AAA T*C CCA AAS 6AA ATC CCT CAS CTC SAO ATC TCT TCT TG* ACA CTC SAC CCC SAC ACe STS TCT CO-: TCC 
' Act Hi. Cy» Leu s.x Cya Sec tya cya Art ty. Slu Met sly CLo Val Clu lit »•« lex Cj< Thx Val A*» Axe Aep Thr Val eyt Sly eyt 

AC* AAS AAC CAS TAC COS CAT TAT TOO ACT EAA AAC CTT TTC CAS TSC TTC AAT TSC ASC CTC TSC CTC AAT SOS ACC STS CAC CTC TCC TSC 
' Alt by* Am Cla tyr Axf Kli Tyr Trp Stx elu A»n Uu Mm 61a Cy« the Am ey» Sex leu Cy» Law An Cly Thr Val Hi* tau l.t Cy» 

CAC GAG AAA CAC AAC ACC CTC TCC ACC TSC CAT CCA CCT TTC TTT CTA ACA CAA AAC CAB TCT CTC TCC TCT ACT AAC TCT AAS AAA ACC CT5 
• cm Slu Lya SUAntm Val Cy. The cya Hit Ala sly Ik* >ha Leu Ait Clu Au Slu Cya Val lax Cyi sex Aan Cya bye Lya Sex l.u 

LM»r ^cyssfhcanph. 

CAC tCC ACS AAS TTC TCC CTA CCC CAS ATT CAC AAT CTT AAC CSC ACT SAC SAC TCA CCC ACC ACA »CC 0»t CCT *>CC CCA #»T TSC CCA 
' Clu Cy. Th» ty* U* Cya Leu fro Cla I la Clu Aa> Vtl bya Cly Thr slu Ai» Sac Sly Thx the Ala Cly Ala Ala Ira sly Cya tea 

CAA TCC ACQ CTA CAS CAA AAC CCA TTC TTC TCC CAC CCC CCT CCC CCA ATA CTT CAS TSC ATC SSC TGC TCC TTC TCf ACA CCA TAT CCe AIT 
' Clu Cya Th* Leu Cla Clu Ajn Ira Ika thm sea Cla tea Cly Ala tea II. L«u 61a Cyi Hat sly Cya Cya »ha Sac Axf Ala Ty* Ic* Thx 

CCA CTA ASC TCC AAS AAC ACC ATC TIC CTC CAA AAC AAC CTC ACC TCA CAC TCC ACT TSC TCT CTA CCT AAA TCA TAT AAC ASC CTC ACA CTA 
' In Leu A*f (a* lye ty* Thx Hac Leu Val ela ty* Ala Val Thx Sac slu Sax The Cyi Cya Val Ala ty. Sax Tyc Aaa Ax* Val Thx Val 

ATC SSC GST TTC AAA CTC CAS AAC CAC ACC CCC TSC CXC TCC AST ACT TCT TAT TAT CAC AAA TCT TAA CCATCCCTCCAC 
' *« Cly Cly Me ty* Val Clu Aaa HA* The Ala Cy* HI* Cy* fax The Cya Tyr Tyr UU ly* Sax B,i,iHI Xhol 



Figure 2(a) 
TBP(20-190)-liCGa FUSION CONSTRUCT 
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B.mHI SV40 E 



TBP190hC6 |J 
3' splice acceptor 




BamHI 
EcoRI 



cxeeM xt a oct aca a criAGCJCcccr*AAAT;;:rrrTccccACAATOTCTCCT ^ 
► atH Ala Thr 

ccTTcrcAATCTSACTATcaccATCTAA^cauiTATT^ csaeccATCcACAgACAijj*CAAAa>gcTCCT»«acc^^ 
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► far Asp Thx lit Leu Leu Lau Ala Pae Cly lea leu Cya tea 
+20A*p«f ProceeeeelTBPI 

CCC TOO CTT CAA OAC OOC ACT CCC CW ACT CTC TOT CCC CAA BOA AAA TAT ATC CAC CCT CAA AAZ AAT TCI ATT TGC TCT ACC 

► Pre Ttp Leu 61a Slu Sly tar All Aip tex V«l Cyi Pxe Cla Cly lyi Tyx 111 HU Ho Uikabi lex II* Cyi Cyi Thz 

AAS TGC CAC AAA GGA ACC TAC TTS TAC AA? CAC TCT CCA G8C CCC CCC C*G GAT ACC GAC TCC ACC CAC TCT CAC ACC CCC TCC TTC ACC 

► lya ey» Kis Lya Cly Tat Tyr lea Tyr An Aip Cy« pee Cly Pxe Cly CI* Aip Tax Asp Cyi Axp Slu cys Cla tit cly tat la* Tax 

CCT TCX CAA AAC CAC CTC AO. CAC TCC CTC ACC TCC TCC AAA TCC CCA AAC CAA ATC SSI CAC CTC CAS ATC TCT TCT TCC ACA CTC SAC 

► XI* Ml Clu Aim Bis law Act Bis Oy» Leu lax Cya tax lya Cyi A*! Lya Clu Mac Cly Cls Vol Clu I la Sax lax Cyi Tax Val Asp 

CCC SAC ACC CTC TCT CCC TCC ACS AAS AAC CAC TAC CCC CAT TAT TCC ACT CAA AAC CTT TTC CAC TCC TTC AAT TCC ACC CTC TCC CTC 

► Axf Aip Tac Val Cyi Cly Cym Axp Lya Am Cla Tyx Aca Mia Tyx TtJ> Sac Clu Aaa Leu pae 61a Cyi Pha Ail Cyi lac leu Cyi Leu 

AAT CSC ACC CTS CAC CTC TCC TCC CAC SAC AAA CAC AAC ACC CTC TCC ACC TCC CAT CCA CCT TTC TIT CTA ACA CAA AAC GAC TCT CTC 

► Am Cly Thx Val Hla lau lar Cya Cla Clu lyi cla Aaa Tax Val Cya Tax Cy» Kla Ala Sly Pae Pha Leu Ara Clu Aaa Clu Cya Val 

TCC TOT ACT AAC TCT AAS AAA ACC CTS CAC TCC ACS AAC TTC TOC CTA CCC CAO ATT GAC AAT CTT AAC CCC ACT CA-# CAC TCA CCC ACC 

► sax cya las Ala Cya lya lya Sac Leu Cla Cyi Tax lyi lau Cya leu Pxa da Xla Cltt Aaa Val tya Sly Tax Cla Aap lax Cly Tax 

ACA CCT OCT CCT OCT CCA CCS TCC CSC CCC ATC AAT CCC ACC CTC CCT CTC CAC AAS CAS CCC TCC CCC CTC TCC ATC ACC CTC AAC 

► Tbx Ala Sly Ala Sly Ixe Ass Cya Az? lis lla Asa Ala Tar Vm Ala Val Clu lyi Clu Sly Cys Pta Val Cya 11* Tot Val Aaa 

ACC ACC ATC TCT CCC CSC TAC TCC CCC ACC ATC ACC CCC CTC CTC CAB CSC CTC CTC CCS CCC CTS CCT CAS CTS CTS TSC AAC TAC CCC 
y Tax Tax lis Cys Ala Sly Tyr Cya Pto Tax Mat Tat Act Val Lau Cla Cly Val lav Pxo Ala lau Pis cla Val Val Cyi Aaa Tyx Ax« 

GAT CTS CSC TTC GAS TCC ATC CSS CTC CCT CSC TGC CCS CSC CSC 6TS AAC CCC 6TS CTC TCC TAC GCC CTS 6CT CTC ACC TCT CAA TGT 

► Aap Val Alp Paa Clu tax lla Acs Leu Pxa Cly Cyi Pes Ac? Cly Val Aaa Pxe Val Val lac Tyx Ala Val Ala lau tax Cyi Cla Cya 

CCA CTC TCC CSC CSC ACC ACC ACT GAC TCC CCS CCT CCC AAC SAC CAC CCC TTC ACC TCT SAT CAC CCC CCC TTC CA3 CAC TCC TCT TCC 

► Ala Leu Cyi Axp Axf tax Tas Tax Asp Cya Cly Cly Pxa lya Aap Kla pxa Lau Tat Cya Aap Aap Pxa Axe Pha Cla Aap lax lac lax 

TCA AAS GCC CCT CCC CCC ACC CTT CCA ACC CCA TCC CCA CTC CCC CCC CCC TCS CAC ACC CCC ATC CTC CCA CAA TAA SSATCCCrCCAC 

► tax Lya Ala Ixe Pxo Pxo tax Lau Pxe lax Pxa tax Act Lau Pxe Cly Pxo tax. Aip Tax Pxo lla Lau Pxo Cla •-• BamM Xhcrf 



Figure 2 (b) 
TBP(20-190)-hCGp FUSION CONSTRUCT 
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